>HEM1
GTTTCTTCGAATTCGCGGCCGCTTCTAGAGAATTTTATTATATAGTTTAAGGGATAATATTTTATTAATATTTTTTTTATTTATTTATTTAATTATATTATATATATAATATATATATAACAATAAATTTATGCAACGTTCTATATTCGCCAGATTTGGTAACAGTAGTGCTGCTGTGTCCACTTTAAACAGACTGTCTACGACGGCTGCGCCTCATGCTAAAAACGGCTACGCAACTGCTACTGGCGCCGGTGCAGCAGCAGCTACAGCAACCGCATCCTCTACACATGCAGCTGCCGCCGCAGCCGCCGCTGCCAACCATTCTACACAAGAATCTGGCTTCGATTATGAGGGACTAATTGATTCAGAATTGCAGAAAAAGCGTTTGGACAAATCTTATAGATACTTTAACAACATTAACAGGCTGGCCAAAGAGTTCCCATTAGCCCATAGACAGAGAGAAGCCGATAAGGTTACGGTCTGGTGCTCAAATGATTATCTTGCTTTATCCAAACATCCAGAGGTGCTTGATGCTATGCACAAAACAATCGATAAATACGGATGCGGTGCTGGTGGTACTCGTAACATTGCCGGACATAACATACCAACCTTGAACTTAGAAGCAGAATTGGCTACGTTACACAAAAAGGAGGGTGCTTTGGTATTTTCTTCATGTTATGTGGCCAATGATGCAGTTCTGAGCTTATTGGGCCAGAAGATGAAAGATCTAGTGATCTTCAGCGATGAACTAAACCACGCCTCCATGATTGTTGGGATCAAACATGCAAATGTTAAGAAACACATTTTTAAGCACAATGATCTGAATGAGCTTGAACAACTGTTACAATCATATCCCAAATCAGTACCTAAATTGATTGCTTTTGAATCTGTGTATTCGATGGCGGGCAGTGTTGCGGATATAGAAAAGATTTGCGACCTTGCCGATAAGTATGGTGCTCTAACCTTTTTAGATGAAGTTCATGCCGTTGGTTTGTACGGCCCTCACGGCGCTGGTGTTGCCGAACATTGTGACTTTGAATCACATAGAGCTTCCGGTATTGCGACGCCAAAAACTAATGATAAGGGTGGTGCTAAAACTGTCATGGATAGAGTGGATATGATTACTGGTACATTAGGGAAATCCTTTGGCTCTGTAGGAGGATATGTTGCGGCGTCAAGAAAACTTATTGACTGGTTTAGATCTTTCGCCCCAGGTTTCATTTTTACAACAACTTTGCCCCCATCAGTGATGGCAGGGGCCACAGCCGCCATAAGATATCAAAGGTGTCATATTGATCTACGTACATCTCAACAAAAGCACACTATGTACGTCAAAAAGGCATTTCATGAGCTGGGTATTCCTGTGATCCCGAATCCTTCTCATATTGTACCTGTGTTGATAGGTAACGCTGATCTAGCAAAACAAGCCTCCGATATTCTGATCAACAAGCATCAAATTTATGTCCAAGCAATCAACTTTCCAACAGTTGCTAGAGGAACTGAAAGGCTGAGAATAACGCCTACTCCAGGTCACACTAACGATTTGAGTGATATTTTGATTAATGCCGTGGATGATGTCTTTAACGAATTGCAATTACCCAGAGTTAGAGACTGGGAGAGTCAAGGTGGTCTTCTGGGGGTGGGAGAAAGCGGCTTTGTTGAAGAATCTAATTTATGGACGAGCAGTCAATTATCCTTGACTAACGATGACCTGAATCCTAATGTCAGAGATCCGATCGTAAAGCAATTAGAAGTTTCGTCTGGGATAAAACAGTATCCATACGATGTACCCGATTACGCGTGAAATTATATATCTAAATGATTAATATATATATTATTAATAATTAACAATAATTAATATATTATAATTTATATATATATATTTTATATTATTATTACTAGTAGCGGCCGCTGCAGGAAGAAAC

Translation Map
New AA Segment
     1 ATGCAACGTTCTATATTCGCCAGATTTGGTAACAGTAGTGCTGCTGTGTCCACTTTAAAC
     1  M  Q  R  S  I  F  A  R  F  G  N  S  S  A  A  V  S  T  L  N 
    61 AGACTGTCTACGACGGCTGCGCCTCATGCTAAAAACGGCTACGCAACTGCTACTGGCGCC
    21  R  L  S  T  T  A  A  P  H  A  K  N  G  Y  A  T  A  T  G  A 
   121 GGTGCAGCAGCAGCTACAGCAACCGCATCCTCTACACATGCAGCTGCCGCCGCAGCCGCC
    41  G  A  A  A  A  T  A  T  A  S  S  T  H  A  A  A  A  A  A  A 
   181 GCTGCCAACCATTCTACACAAGAATCTGGCTTCGATTATGAGGGACTAATTGATTCAGAA
    61  A  A  N  H  S  T  Q  E  S  G  F  D  Y  E  G  L  I  D  S  E 
   241 TTGCAGAAAAAGCGTTTGGACAAATCTTATAGATACTTTAACAACATTAACAGGCTGGCC
    81  L  Q  K  K  R  L  D  K  S  Y  R  Y  F  N  N  I  N  R  L  A 
   301 AAAGAGTTCCCATTAGCCCATAGACAGAGAGAAGCCGATAAGGTTACGGTCTGGTGCTCA
   101  K  E  F  P  L  A  H  R  Q  R  E  A  D  K  V  T  V  W  C  S 
   361 AATGATTATCTTGCTTTATCCAAACATCCAGAGGTGCTTGATGCTATGCACAAAACAATC
   121  N  D  Y  L  A  L  S  K  H  P  E  V  L  D  A  M  H  K  T  I 
   421 GATAAATACGGATGCGGTGCTGGTGGTACTCGTAACATTGCCGGACATAACATACCAACC
   141  D  K  Y  G  C  G  A  G  G  T  R  N  I  A  G  H  N  I  P  T 
   481 TTGAACTTAGAAGCAGAATTGGCTACGTTACACAAAAAGGAGGGTGCTTTGGTATTTTCT
   161  L  N  L  E  A  E  L  A  T  L  H  K  K  E  G  A  L  V  F  S 
   541 TCATGTTATGTGGCCAATGATGCAGTTCTGAGCTTATTGGGCCAGAAGATGAAAGATCTA
   181  S  C  Y  V  A  N  D  A  V  L  S  L  L  G  Q  K  M  K  D  L 
   601 GTGATCTTCAGCGATGAACTAAACCACGCCTCCATGATTGTTGGGATCAAACATGCAAAT
   201  V  I  F  S  D  E  L  N  H  A  S  M  I  V  G  I  K  H  A  N 
   661 GTTAAGAAACACATTTTTAAGCACAATGATCTGAATGAGCTTGAACAACTGTTACAATCA
   221  V  K  K  H  I  F  K  H  N  D  L  N  E  L  E  Q  L  L  Q  S 
   721 TATCCCAAATCAGTACCTAAATTGATTGCTTTTGAATCTGTGTATTCGATGGCGGGCAGT
   241  Y  P  K  S  V  P  K  L  I  A  F  E  S  V  Y  S  M  A  G  S 
   781 GTTGCGGATATAGAAAAGATTTGCGACCTTGCCGATAAGTATGGTGCTCTAACCTTTTTA
   261  V  A  D  I  E  K  I  C  D  L  A  D  K  Y  G  A  L  T  F  L 
   841 GATGAAGTTCATGCCGTTGGTTTGTACGGCCCTCACGGCGCTGGTGTTGCCGAACATTGT
   281  D  E  V  H  A  V  G  L  Y  G  P  H  G  A  G  V  A  E  H  C 
   901 GACTTTGAATCACATAGAGCTTCCGGTATTGCGACGCCAAAAACTAATGATAAGGGTGGT
   301  D  F  E  S  H  R  A  S  G  I  A  T  P  K  T  N  D  K  G  G 
   961 GCTAAAACTGTCATGGATAGAGTGGATATGATTACTGGTACATTAGGGAAATCCTTTGGC
   321  A  K  T  V  M  D  R  V  D  M  I  T  G  T  L  G  K  S  F  G 
  1021 TCTGTAGGAGGATATGTTGCGGCGTCAAGAAAACTTATTGACTGGTTTAGATCTTTCGCC
   341  S  V  G  G  Y  V  A  A  S  R  K  L  I  D  W  F  R  S  F  A 
  1081 CCAGGTTTCATTTTTACAACAACTTTGCCCCCATCAGTGATGGCAGGGGCCACAGCCGCC
   361  P  G  F  I  F  T  T  T  L  P  P  S  V  M  A  G  A  T  A  A 
  1141 ATAAGATATCAAAGGTGTCATATTGATCTACGTACATCTCAACAAAAGCACACTATGTAC
   381  I  R  Y  Q  R  C  H  I  D  L  R  T  S  Q  Q  K  H  T  M  Y 
  1201 GTCAAAAAGGCATTTCATGAGCTGGGTATTCCTGTGATCCCGAATCCTTCTCATATTGTA
   401  V  K  K  A  F  H  E  L  G  I  P  V  I  P  N  P  S  H  I  V 
  1261 CCTGTGTTGATAGGTAACGCTGATCTAGCAAAACAAGCCTCCGATATTCTGATCAACAAG
   421  P  V  L  I  G  N  A  D  L  A  K  Q  A  S  D  I  L  I  N  K 
  1321 CATCAAATTTATGTCCAAGCAATCAACTTTCCAACAGTTGCTAGAGGAACTGAAAGGCTG
   441  H  Q  I  Y  V  Q  A  I  N  F  P  T  V  A  R  G  T  E  R  L 
  1381 AGAATAACGCCTACTCCAGGTCACACTAACGATTTGAGTGATATTTTGATTAATGCCGTG
   461  R  I  T  P  T  P  G  H  T  N  D  L  S  D  I  L  I  N  A  V 
  1441 GATGATGTCTTTAACGAATTGCAATTACCCAGAGTTAGAGACTGGGAGAGTCAAGGTGGT
   481  D  D  V  F  N  E  L  Q  L  P  R  V  R  D  W  E  S  Q  G  G 
  1501 CTTCTGGGGGTGGGAGAAAGCGGCTTTGTTGAAGAATCTAATTTATGGACGAGCAGTCAA
   501  L  L  G  V  G  E  S  G  F  V  E  E  S  N  L  W  T  S  S  Q 
  1561 TTATCCTTGACTAACGATGACCTGAATCCTAATGTCAGAGATCCGATCGTAAAGCAATTA
   521  L  S  L  T  N  D  D  L  N  P  N  V  R  D  P  I  V  K  Q  L 
  1621 GAAGTTTCGTCTGGGATAAAACAGTATCCATACGATGTACCCGATTACGCG
   541  E  V  S  S  G  I  K  Q  Y  P  Y  D  V  P  D  Y  A 
Stop
     1 TGA
     1  * 

Restriction Sites
Name	Seq.	Locations
AatI	AGGCCT	none
AccI	GTMKAC	195
AflII	CTTAAG	none
AgeI	ACCGGT	none
AlwI	GGATC	773, 1365(c), 1729(c)
AlwNI	CAGNNNCTG	none
ApaI	GGGCCC	none
ApaLI	GTGCAC	none
AscI	GGCGCGCC	none
AseI	ATTAAT	63, 1558, 1821, 1836, 1853
AvaI	CYCGRG	none
AvaII	GGWCC	none
AvrII	CCTAGG	none
BamHI	GGATCC	none
BbsI	GAAGAC	1628(c)
BbvI	GCAGC	253, 259, 289, 301, 169(c), 205(c), 292(c), 310(c), 1910(c)
BclI	TGATCA	1439
BglI	GCCNNNNNGGC	998
BglII	AGATCT	723, 1198
BlpI	GCTNAGC	none
BsaI	GGTCTC	none
BsmAI	GTCTC	1607(c)
BsmBI	CGTCTC	none
BstEII	GGTNACC	none
BstXI	CCANNNNNNTGG	none
ClaI	ATCGAT	547
DraIII	CACNNNGTG	none
EagI	CGGCCG	15, 1905
EarI	CTCTTC	none
EcoRI	GAATTC	8
EcoRV	GATATC	1274
FokI	GGATG	559, 1569, 275(c), 514(c)
FseI	GGCCGGCC	none
HindIII	AAGCTT	none
KasI	GGCGCC	244
KpnI	GGTACC	none
MluI	ACGCGT	1796
NarI	GGCGCC	244
NcoI	CCATGG	none
NdeI	CATATG	none
NheI	GCTAGC	none
NotI	GCGGCCGC	14, 1904
NsiI	ATGCAT	none
PacI	TTAATTAA	none
PciI	ACATGT	none
PmeI	GTTTAAAC	none
PstI	CTGCAG	1911
PvuI	CGATCG	1733
PvuII	CAGCTG	290
SacI	GAGCTC	none
SacII	CCGCGG	none
SalI	GTCGAC	none
SapI	GCTCTTC	none
SfiI	GGCCNNNNNGGCC	none
SgrAI	CRCCGGYG	246
SmaI	CCCGGG	none
SpeI	ACTAGT	1897
SphI	GCATGC	none
SspI	AATATT	55, 66
StuI	AGGCCT	none
SwaI	ATTTAAAT	none
TliI	CTCGAG	none
XbaI	TCTAGA	23
XhoI	CTCGAG	none
XmaI	CCCGGG	none
XmnI	GAANNNNTTC	none

Codon Usage Table
AmAcid	Codon	Number	/1000	Fraction

END	TAA	0	0.0	0.0
END	TGA	1	1.79	1.0
END	TAG	0	0.0	0.0

ALA	GCT	20	35.84	0.31
ALA	GCA	15	26.88	0.23
ALA	GCC	22	39.42	0.34
ALA	GCG	7	12.54	0.10

CYS	TGT	3	5.37	0.5
CYS	TGC	3	5.37	0.5

ASP	GAT	27	48.38	0.81
ASP	GAC	6	10.75	0.18

GLU	GAA	18	32.25	0.72
GLU	GAG	7	12.54	0.28

PHE	TTT	14	25.08	0.7
PHE	TTC	6	10.75	0.3

GLY	GGT	19	34.05	0.47
GLY	GGA	7	12.54	0.17
GLY	GGC	9	16.12	0.22
GLY	GGG	5	8.96	0.12

HIS	CAT	14	25.08	0.63
HIS	CAC	8	14.33	0.36

ILE	ATT	18	32.25	0.56
ILE	ATA	7	12.54	0.21
ILE	ATC	7	12.54	0.21

LYS	AAA	20	35.84	0.60
LYS	AAG	13	23.29	0.39

LEU	TTG	14	25.08	0.28
LEU	TTA	13	23.29	0.26
LEU	CTA	6	10.75	0.12
LEU	CTT	6	10.75	0.12
LEU	CTG	10	17.92	0.20
LEU	CTC	0	0.0	0.0

MET	ATG	9	16.12	1.0

ASN	AAT	11	19.71	0.39
ASN	AAC	17	30.46	0.60

PRO	CCA	9	16.12	0.39
PRO	CCT	8	14.33	0.34
PRO	CCC	4	7.16	0.17
PRO	CCG	2	3.58	0.08

GLN	CAA	14	25.08	0.77
GLN	CAG	4	7.16	0.22

ARG	AGA	15	26.88	0.68
ARG	AGG	3	5.37	0.13
ARG	CGT	4	7.16	0.18
ARG	CGA	0	0.0	0.0
ARG	CGC	0	0.0	0.0
ARG	CGG	0	0.0	0.0

SER	TCT	14	25.08	0.33
SER	TCA	8	14.33	0.19
SER	AGT	6	10.75	0.14
SER	TCC	8	14.33	0.19
SER	AGC	4	7.16	0.09
SER	TCG	2	3.58	0.04

THR	ACT	13	23.29	0.39
THR	ACA	10	17.92	0.30
THR	ACC	3	5.37	0.09
THR	ACG	7	12.54	0.21

VAL	GTT	13	23.29	0.36
VAL	GTA	6	10.75	0.16
VAL	GTC	6	10.75	0.16
VAL	GTG	11	19.71	0.30

TRP	TGG	4	7.16	1.0

TYR	TAT	11	19.71	0.61
TYR	TAC	7	12.54	0.38

GC Percentage: 40.0%


